Overview

Dataset statistics

Number of variables18
Number of observations2000
Missing cells35
Missing cells (%)0.1%
Duplicate rows59
Duplicate rows (%)2.9%
Total size in memory281.4 KiB
Average record size in memory144.1 B

Variable types

Unsupported2
Numeric13
Categorical3

Alerts

Dataset has 59 (2.9%) duplicate rowsDuplicates
genre has a high cardinality: 59 distinct values High cardinality
explicit is highly correlated with speechiness and 1 other fieldsHigh correlation
genre is highly correlated with explicit and 3 other fieldsHigh correlation
energy is highly correlated with loudness and 2 other fieldsHigh correlation
loudness is highly correlated with energy and 1 other fieldsHigh correlation
speechiness is highly correlated with explicitHigh correlation
acousticness is highly correlated with energyHigh correlation
instrumentalness is highly correlated with genreHigh correlation
artist is an unsupported type, check if it needs cleaning or further analysis Unsupported
song is an unsupported type, check if it needs cleaning or further analysis Unsupported
popularity has 126 (6.3%) zeros Zeros
key has 198 (9.9%) zeros Zeros
instrumentalness has 1085 (54.2%) zeros Zeros

Reproduction

Analysis started2022-09-12 03:08:09.767588
Analysis finished2022-09-12 03:08:43.648486
Duration33.88 seconds
Software versionpandas-profiling v3.3.0
Download configurationconfig.json

Variables

artist
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size15.8 KiB

song
Unsupported

REJECTED
UNSUPPORTED

Missing0
Missing (%)0.0%
Memory size15.8 KiB

duration_ms
Real number (ℝ≥0)

Distinct1790
Distinct (%)89.6%
Missing3
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean228750.1627
Minimum113000
Maximum484146
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum113000
5-th percentile174954.6
Q1203520
median223266
Q3248133
95-th percentile298870.4
Maximum484146
Range371146
Interquartile range (IQR)44613

Descriptive statistics

Standard deviation39164.93365
Coefficient of variation (CV)0.1712127029
Kurtosis3.305887399
Mean228750.1627
Median Absolute Deviation (MAD)21680
Skewness1.01809728
Sum456814075
Variance1533892028
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2121064
 
0.2%
2400403
 
0.1%
2001063
 
0.1%
2020663
 
0.1%
2435333
 
0.1%
2495333
 
0.1%
2688663
 
0.1%
2361333
 
0.1%
1994803
 
0.1%
2572003
 
0.1%
Other values (1780)1966
98.3%
ValueCountFrequency (%)
1130001
0.1%
1148931
0.1%
1191331
0.1%
1218861
0.1%
1240551
0.1%
1264461
0.1%
1279201
0.1%
1292641
0.1%
1310641
0.1%
1312131
0.1%
ValueCountFrequency (%)
4841461
0.1%
4529061
0.1%
4485731
0.1%
4443331
0.1%
4321461
0.1%
4179201
0.1%
4041061
0.1%
3938131
0.1%
3667331
0.1%
3599731
0.1%

explicit
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing2
Missing (%)0.1%
Memory size15.8 KiB
0.0
1447 
1.0
551 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5994
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.01447
72.4%
1.0551
 
27.6%
(Missing)2
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.01447
72.4%
1.0551
 
27.6%

Most occurring characters

ValueCountFrequency (%)
03445
57.5%
.1998
33.3%
1551
 
9.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3996
66.7%
Other Punctuation1998
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
03445
86.2%
1551
 
13.8%
Other Punctuation
ValueCountFrequency (%)
.1998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common5994
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
03445
57.5%
.1998
33.3%
1551
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII5994
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
03445
57.5%
.1998
33.3%
1551
 
9.2%

year
Real number (ℝ≥0)

Distinct23
Distinct (%)1.2%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2009.490245
Minimum1998
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum1998
5-th percentile2000
Q12004
median2010
Q32015
95-th percentile2018
Maximum2020
Range22
Interquartile range (IQR)11

Descriptive statistics

Standard deviation5.859019372
Coefficient of variation (CV)0.002915674453
Kurtosis-1.194757502
Mean2009.490245
Median Absolute Deviation (MAD)5
Skewness-0.04537927933
Sum4016971
Variance34.328108
MonotonicityNot monotonic
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
2012115
 
5.8%
2017110
 
5.5%
2001108
 
5.4%
2010107
 
5.3%
2018107
 
5.3%
2014104
 
5.2%
2005104
 
5.2%
201199
 
5.0%
201699
 
5.0%
201599
 
5.0%
Other values (13)947
47.3%
ValueCountFrequency (%)
19981
 
0.1%
199938
 
1.9%
200074
3.7%
2001108
5.4%
200290
4.5%
200397
4.9%
200496
4.8%
2005104
5.2%
200695
4.8%
200794
4.7%
ValueCountFrequency (%)
20203
 
0.1%
201989
4.5%
2018107
5.3%
2017110
5.5%
201699
5.0%
201599
5.0%
2014104
5.2%
201389
4.5%
2012115
5.8%
201199
5.0%

popularity
Real number (ℝ≥0)

ZEROS

Distinct76
Distinct (%)3.8%
Missing4
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean59.85220441
Minimum0
Maximum89
Zeros126
Zeros (%)6.3%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q156
median65
Q373
95-th percentile80
Maximum89
Range89
Interquartile range (IQR)17

Descriptive statistics

Standard deviation21.35163767
Coefficient of variation (CV)0.3567393696
Kurtosis2.645965134
Mean59.85220441
Median Absolute Deviation (MAD)8
Skewness-1.821383915
Sum119465
Variance455.8924312
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0126
 
6.3%
6976
 
3.8%
6875
 
3.8%
7369
 
3.5%
7468
 
3.4%
6766
 
3.3%
7664
 
3.2%
6463
 
3.1%
7262
 
3.1%
5762
 
3.1%
Other values (66)1265
63.2%
ValueCountFrequency (%)
0126
6.3%
131
 
1.6%
211
 
0.5%
35
 
0.2%
44
 
0.2%
61
 
0.1%
71
 
0.1%
81
 
0.1%
111
 
0.1%
161
 
0.1%
ValueCountFrequency (%)
891
 
0.1%
881
 
0.1%
871
 
0.1%
864
 
0.2%
857
 
0.4%
8411
 
0.5%
8315
 
0.8%
8225
1.2%
8127
1.4%
8043
2.1%

danceability
Real number (ℝ≥0)

Distinct565
Distinct (%)28.3%
Missing4
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.6673446894
Minimum0.129
Maximum0.975
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0.129
5-th percentile0.4195
Q10.581
median0.676
Q30.764
95-th percentile0.886
Maximum0.975
Range0.846
Interquartile range (IQR)0.183

Descriptive statistics

Standard deviation0.1405145447
Coefficient of variation (CV)0.2105576727
Kurtosis0.1214117439
Mean0.6673446894
Median Absolute Deviation (MAD)0.091
Skewness-0.4265341423
Sum1332.02
Variance0.01974433727
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.68812
 
0.6%
0.73612
 
0.6%
0.68711
 
0.5%
0.79111
 
0.5%
0.68211
 
0.5%
0.65610
 
0.5%
0.7610
 
0.5%
0.79410
 
0.5%
0.6610
 
0.5%
0.6810
 
0.5%
Other values (555)1889
94.5%
ValueCountFrequency (%)
0.1291
0.1%
0.1771
0.1%
0.1791
0.1%
0.181
0.1%
0.191
0.1%
0.2091
0.1%
0.2171
0.1%
0.231
0.1%
0.2561
0.1%
0.2591
0.1%
ValueCountFrequency (%)
0.9751
0.1%
0.971
0.1%
0.9691
0.1%
0.9671
0.1%
0.9642
0.1%
0.9632
0.1%
0.9621
0.1%
0.9561
0.1%
0.9551
0.1%
0.9511
0.1%

energy
Real number (ℝ≥0)

HIGH CORRELATION

Distinct580
Distinct (%)29.0%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.7202966483
Minimum0.0549
Maximum0.999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0.0549
5-th percentile0.4429
Q10.622
median0.736
Q30.839
95-th percentile0.9331
Maximum0.999
Range0.9441
Interquartile range (IQR)0.217

Descriptive statistics

Standard deviation0.152752005
Coefficient of variation (CV)0.2120681879
Kurtosis0.1759961291
Mean0.7202966483
Median Absolute Deviation (MAD)0.108
Skewness-0.632140422
Sum1439.873
Variance0.02333317503
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.78315
 
0.8%
0.86212
 
0.6%
0.76811
 
0.5%
0.810
 
0.5%
0.79110
 
0.5%
0.6289
 
0.4%
0.8619
 
0.4%
0.7869
 
0.4%
0.9219
 
0.4%
0.6779
 
0.4%
Other values (570)1896
94.8%
ValueCountFrequency (%)
0.05491
0.1%
0.05811
0.1%
0.1151
0.1%
0.2031
0.1%
0.2191
0.1%
0.2471
0.1%
0.2491
0.1%
0.2611
0.1%
0.2641
0.1%
0.2651
0.1%
ValueCountFrequency (%)
0.9991
 
0.1%
0.9881
 
0.1%
0.9851
 
0.1%
0.9841
 
0.1%
0.9821
 
0.1%
0.9811
 
0.1%
0.9791
 
0.1%
0.9781
 
0.1%
0.9772
0.1%
0.9763
0.1%

key
Real number (ℝ≥0)

ZEROS

Distinct12
Distinct (%)0.6%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean5.378189095
Minimum0
Maximum11
Zeros198
Zeros (%)9.9%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median6
Q38
95-th percentile11
Maximum11
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.61595349
Coefficient of variation (CV)0.6723366223
Kurtosis-1.298666178
Mean5.378189095
Median Absolute Deviation (MAD)3
Skewness-0.009533483866
Sum10751
Variance13.07511964
MonotonicityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1267
13.4%
11199
10.0%
0198
9.9%
7197
9.8%
5181
9.0%
8173
8.6%
2158
7.9%
9157
7.8%
6154
7.7%
10129
6.5%
Other values (2)186
9.3%
ValueCountFrequency (%)
0198
9.9%
1267
13.4%
2158
7.9%
360
 
3.0%
4126
6.3%
5181
9.0%
6154
7.7%
7197
9.8%
8173
8.6%
9157
7.8%
ValueCountFrequency (%)
11199
10.0%
10129
6.5%
9157
7.8%
8173
8.6%
7197
9.8%
6154
7.7%
5181
9.0%
4126
6.3%
360
 
3.0%
2158
7.9%

loudness
Real number (ℝ)

HIGH CORRELATION

Distinct1669
Distinct (%)83.5%
Missing2
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean-4992.963732
Minimum-20514
Maximum-0.276
Zeros0
Zeros (%)0.0%
Negative1998
Negative (%)99.9%
Memory size15.8 KiB

Quantile statistics

Minimum-20514
5-th percentile-8728.2
Q1-6336
median-5076.5
Q3-3836.75
95-th percentile-5.3455
Maximum-0.276
Range20513.724
Interquartile range (IQR)2499.25

Descriptive statistics

Standard deviation2429.823923
Coefficient of variation (CV)-0.486649624
Kurtosis1.821564258
Mean-4992.963732
Median Absolute Deviation (MAD)1245.5
Skewness-0.08886108204
Sum-9975941.536
Variance5904044.296
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-63665
 
0.2%
-55954
 
0.2%
-37824
 
0.2%
-51534
 
0.2%
-50963
 
0.1%
-30783
 
0.1%
-43153
 
0.1%
-56113
 
0.1%
-58923
 
0.1%
-38873
 
0.1%
Other values (1659)1963
98.2%
ValueCountFrequency (%)
-205141
0.1%
-172171
0.1%
-156361
0.1%
-145051
0.1%
-139641
0.1%
-137441
0.1%
-136091
0.1%
-132031
0.1%
-129321
0.1%
-128521
0.1%
ValueCountFrequency (%)
-0.2761
0.1%
-0.741
0.1%
-1.191
0.1%
-1.731
0.1%
-2.182
0.1%
-2.281
0.1%
-2.361
0.1%
-2.611
0.1%
-2.691
0.1%
-2.761
0.1%

mode
Categorical

Distinct2
Distinct (%)0.1%
Missing1
Missing (%)< 0.1%
Memory size15.8 KiB
1.0
1107 
0.0
892 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5997
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row1.0
3rd row1.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
1.01107
55.4%
0.0892
44.6%
(Missing)1
 
0.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
1.01107
55.4%
0.0892
44.6%

Most occurring characters

ValueCountFrequency (%)
02891
48.2%
.1999
33.3%
11107
 
18.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number3998
66.7%
Other Punctuation1999
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
02891
72.3%
11107
 
27.7%
Other Punctuation
ValueCountFrequency (%)
.1999
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common5997
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
02891
48.2%
.1999
33.3%
11107
 
18.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII5997
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
02891
48.2%
.1999
33.3%
11107
 
18.5%

speechiness
Real number (ℝ≥0)

HIGH CORRELATION

Distinct836
Distinct (%)41.8%
Missing2
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.1036066066
Minimum0.0232
Maximum0.576
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0.0232
5-th percentile0.029
Q10.0396
median0.05985
Q30.129
95-th percentile0.322
Maximum0.576
Range0.5528
Interquartile range (IQR)0.0894

Descriptive statistics

Standard deviation0.09619269207
Coefficient of variation (CV)0.9284416817
Kurtosis2.620774903
Mean0.1036066066
Median Absolute Deviation (MAD)0.02645
Skewness1.760789253
Sum207.006
Variance0.009253034007
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.043213
 
0.7%
0.02912
 
0.6%
0.032211
 
0.5%
0.036310
 
0.5%
0.1099
 
0.4%
0.04399
 
0.4%
0.03778
 
0.4%
0.1088
 
0.4%
0.0468
 
0.4%
0.04318
 
0.4%
Other values (826)1902
95.1%
ValueCountFrequency (%)
0.02321
0.1%
0.02391
0.1%
0.02411
0.1%
0.02421
0.1%
0.02452
0.1%
0.02471
0.1%
0.02492
0.1%
0.02522
0.1%
0.02532
0.1%
0.02552
0.1%
ValueCountFrequency (%)
0.5761
0.1%
0.531
0.1%
0.5161
0.1%
0.5051
0.1%
0.4881
0.1%
0.4841
0.1%
0.4831
0.1%
0.4781
0.1%
0.471
0.1%
0.4671
0.1%

acousticness
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1208
Distinct (%)60.5%
Missing4
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean0.1291619519
Minimum1.92 × 10-5
Maximum0.976
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum1.92 × 10-5
5-th percentile0.0009825
Q10.013925
median0.05585
Q30.177
95-th percentile0.515
Maximum0.976
Range0.9759808
Interquartile range (IQR)0.163075

Descriptive statistics

Standard deviation0.1734572072
Coefficient of variation (CV)1.342943527
Kurtosis4.651408522
Mean0.1291619519
Median Absolute Deviation (MAD)0.05115
Skewness2.091152906
Sum257.8072559
Variance0.03008740272
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.157
 
0.4%
0.1077
 
0.4%
0.237
 
0.4%
0.1916
 
0.3%
0.1576
 
0.3%
0.1066
 
0.3%
0.1096
 
0.3%
0.1236
 
0.3%
0.01926
 
0.3%
0.1026
 
0.3%
Other values (1198)1933
96.7%
ValueCountFrequency (%)
1.92 × 10-51
0.1%
2.06 × 10-51
0.1%
2.64 × 10-51
0.1%
3.82 × 10-51
0.1%
4.14 × 10-51
0.1%
5.15 × 10-51
0.1%
5.48 × 10-51
0.1%
6.52 × 10-51
0.1%
6.79 × 10-51
0.1%
7.95 × 10-51
0.1%
ValueCountFrequency (%)
0.9761
0.1%
0.9661
0.1%
0.9531
0.1%
0.9451
0.1%
0.9342
0.1%
0.9321
0.1%
0.9221
0.1%
0.8961
0.1%
0.8931
0.1%
0.8831
0.1%

instrumentalness
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct772
Distinct (%)38.6%
Missing2
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.01524123184
Minimum0
Maximum0.985
Zeros1085
Zeros (%)54.2%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q36.89 × 10-5
95-th percentile0.034515
Maximum0.985
Range0.985
Interquartile range (IQR)6.89 × 10-5

Descriptive statistics

Standard deviation0.08781333857
Coefficient of variation (CV)5.761564388
Kurtosis61.40423073
Mean0.01524123184
Median Absolute Deviation (MAD)0
Skewness7.577776056
Sum30.45198121
Variance0.007711182431
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01085
54.2%
0.00133
 
0.1%
0.0001083
 
0.1%
0.0001393
 
0.1%
0.0001133
 
0.1%
8.83 × 10-63
 
0.1%
1.96 × 10-63
 
0.1%
2.77 × 10-63
 
0.1%
0.0001573
 
0.1%
1.81 × 10-63
 
0.1%
Other values (762)886
44.3%
ValueCountFrequency (%)
01085
54.2%
1.01 × 10-61
 
0.1%
1.03 × 10-63
 
0.1%
1.04 × 10-61
 
0.1%
1.07 × 10-61
 
0.1%
1.1 × 10-61
 
0.1%
1.11 × 10-63
 
0.1%
1.13 × 10-61
 
0.1%
1.16 × 10-62
 
0.1%
1.2 × 10-61
 
0.1%
ValueCountFrequency (%)
0.9851
0.1%
0.9251
0.1%
0.9011
0.1%
0.8941
0.1%
0.8281
0.1%
0.8121
0.1%
0.8091
0.1%
0.7991
0.1%
0.7921
0.1%
0.7511
0.1%

liveness
Real number (ℝ≥0)

Distinct782
Distinct (%)39.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.1812956978
Minimum0.0234
Maximum0.853
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0.0234
5-th percentile0.05169
Q10.08815
median0.124
Q30.241
95-th percentile0.4606
Maximum0.853
Range0.8296
Interquartile range (IQR)0.15285

Descriptive statistics

Standard deviation0.1406590055
Coefficient of variation (CV)0.7758540723
Kurtosis3.831954855
Mean0.1812956978
Median Absolute Deviation (MAD)0.0504
Skewness1.848935403
Sum362.4101
Variance0.01978495582
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.10425
 
1.2%
0.11123
 
1.1%
0.10720
 
1.0%
0.10819
 
0.9%
0.11819
 
0.9%
0.11218
 
0.9%
0.10617
 
0.9%
0.10516
 
0.8%
0.10115
 
0.8%
0.12415
 
0.8%
Other values (772)1812
90.6%
ValueCountFrequency (%)
0.02341
0.1%
0.02411
0.1%
0.02631
0.1%
0.02721
0.1%
0.0281
0.1%
0.02832
0.1%
0.02861
0.1%
0.02882
0.1%
0.0291
0.1%
0.03041
0.1%
ValueCountFrequency (%)
0.8531
0.1%
0.8431
0.1%
0.8391
0.1%
0.8331
0.1%
0.8261
0.1%
0.821
0.1%
0.8171
0.1%
0.8011
0.1%
0.7951
0.1%
0.7751
0.1%

valence
Real number (ℝ≥0)

Distinct760
Distinct (%)38.0%
Missing2
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean0.5515346847
Minimum0.0381
Maximum0.973
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum0.0381
5-th percentile0.1737
Q10.38625
median0.5575
Q30.73
95-th percentile0.895
Maximum0.973
Range0.9349
Interquartile range (IQR)0.34375

Descriptive statistics

Standard deviation0.2208111539
Coefficient of variation (CV)0.4003576929
Kurtosis-0.8221086212
Mean0.5515346847
Median Absolute Deviation (MAD)0.1715
Skewness-0.1293595051
Sum1101.9663
Variance0.0487575657
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.41811
 
0.5%
0.749
 
0.4%
0.5278
 
0.4%
0.48
 
0.4%
0.8018
 
0.4%
0.4468
 
0.4%
0.2727
 
0.4%
0.7167
 
0.4%
0.5547
 
0.4%
0.5867
 
0.4%
Other values (750)1918
95.9%
ValueCountFrequency (%)
0.03811
0.1%
0.04061
0.1%
0.05941
0.1%
0.05961
0.1%
0.06811
0.1%
0.06941
0.1%
0.07561
0.1%
0.07831
0.1%
0.07841
0.1%
0.07891
0.1%
ValueCountFrequency (%)
0.9732
 
0.1%
0.9722
 
0.1%
0.9691
 
0.1%
0.9681
 
0.1%
0.9664
0.2%
0.9653
0.1%
0.9644
0.2%
0.9632
 
0.1%
0.9625
0.2%
0.9614
0.2%

tempo
Real number (ℝ≥0)

Distinct1828
Distinct (%)91.5%
Missing3
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean107236.0186
Minimum75.09
Maximum210851
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.8 KiB

Quantile statistics

Minimum75.09
5-th percentile117.248
Q193023
median116044
Q3131059
95-th percentile171373.4
Maximum210851
Range210775.91
Interquartile range (IQR)38036

Descriptive statistics

Standard deviation45263.83167
Coefficient of variation (CV)0.4220954138
Kurtosis1.023432492
Mean107236.0186
Median Absolute Deviation (MAD)20201
Skewness-1.037539384
Sum214150329.2
Variance2048814457
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1400224
 
0.2%
1303
 
0.1%
1000113
 
0.1%
979543
 
0.1%
1200033
 
0.1%
91.033
 
0.1%
1439943
 
0.1%
1219963
 
0.1%
1280083
 
0.1%
830663
 
0.1%
Other values (1818)1966
98.3%
ValueCountFrequency (%)
75.091
0.1%
761
0.1%
76.911
0.1%
77.491
0.1%
79.011
0.1%
801
0.1%
82.481
0.1%
82.821
0.1%
82.921
0.1%
83.461
0.1%
ValueCountFrequency (%)
2108511
0.1%
2039111
0.1%
2038621
0.1%
2020151
0.1%
2019361
0.1%
1999581
0.1%
1999351
0.1%
1997641
0.1%
1980751
0.1%
1980651
0.1%

genre
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct59
Distinct (%)3.0%
Missing2
Missing (%)0.1%
Memory size15.8 KiB
pop
427 
hip hop, pop
277 
hip hop, pop, R&B
244 
pop, Dance/Electronic
221 
pop, R&B
178 
Other values (54)
651 

Length

Max length37
Median length32
Mean length11.88688689
Min length3

Characters and Unicode

Total characters23750
Distinct characters35
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)0.9%

Sample

1st rowpop
2nd rowrock, pop
3rd rowpop, country
4th rowrock, metal
5th rowpop

Common Values

ValueCountFrequency (%)
pop427
21.3%
hip hop, pop277
13.9%
hip hop, pop, R&B244
12.2%
pop, Dance/Electronic221
11.1%
pop, R&B178
8.9%
hip hop124
 
6.2%
hip hop, pop, Dance/Electronic78
 
3.9%
rock57
 
2.9%
rock, pop43
 
2.1%
Dance/Electronic41
 
2.1%
Other values (49)308
15.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
pop1632
36.4%
hip778
17.3%
hop778
17.3%
r&b452
 
10.1%
dance/electronic390
 
8.7%
rock233
 
5.2%
metal66
 
1.5%
latin64
 
1.4%
set22
 
0.5%
country21
 
0.5%
Other values (7)51
 
1.1%

Most occurring characters

ValueCountFrequency (%)
p4820
20.3%
o3114
13.1%
2489
10.5%
,1704
 
7.2%
h1556
 
6.6%
c1466
 
6.2%
i1287
 
5.4%
n889
 
3.7%
e886
 
3.7%
r664
 
2.8%
Other values (25)4875
20.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter16897
71.1%
Other Punctuation2576
 
10.8%
Space Separator2489
 
10.5%
Uppercase Letter1744
 
7.3%
Close Punctuation22
 
0.1%
Open Punctuation22
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
p4820
28.5%
o3114
18.4%
h1556
 
9.2%
c1466
 
8.7%
i1287
 
7.6%
n889
 
5.3%
e886
 
5.2%
r664
 
3.9%
t600
 
3.6%
l573
 
3.4%
Other values (11)1042
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
B452
25.9%
R452
25.9%
E390
22.4%
D390
22.4%
F20
 
1.1%
A20
 
1.1%
W10
 
0.6%
T10
 
0.6%
Other Punctuation
ValueCountFrequency (%)
,1704
66.1%
&452
 
17.5%
/420
 
16.3%
Space Separator
ValueCountFrequency (%)
2489
100.0%
Close Punctuation
ValueCountFrequency (%)
)22
100.0%
Open Punctuation
ValueCountFrequency (%)
(22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin18641
78.5%
Common5109
 
21.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
p4820
25.9%
o3114
16.7%
h1556
 
8.3%
c1466
 
7.9%
i1287
 
6.9%
n889
 
4.8%
e886
 
4.8%
r664
 
3.6%
t600
 
3.2%
l573
 
3.1%
Other values (19)2786
14.9%
Common
ValueCountFrequency (%)
2489
48.7%
,1704
33.4%
&452
 
8.8%
/420
 
8.2%
)22
 
0.4%
(22
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII23750
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
p4820
20.3%
o3114
13.1%
2489
10.5%
,1704
 
7.2%
h1556
 
6.6%
c1466
 
6.2%
i1287
 
5.4%
n889
 
3.7%
e886
 
3.7%
r664
 
2.8%
Other values (25)4875
20.5%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

artistsongduration_msexplicityearpopularitydanceabilityenergykeyloudnessmodespeechinessacousticnessinstrumentalnesslivenessvalencetempogenre
0Britney SpearsOops!...I Did It Again211160.00.02000.077.00.7510.8341.0-5444.00.00.04370.300000.0000180.35500.89495053.0pop
1blink-182All The Small Things167066.00.01999.079.00.4340.8970.0-4918.01.00.04880.010300.0000000.61200.684148726.0rock, pop
2Faith HillBreathe250546.00.01999.066.00.5290.4967.0-9007.01.00.02900.173000.0000000.25100.278136859.0pop, country
3Bon JoviIt's My Life224493.00.02000.078.00.5510.9130.0-4063.00.00.04660.026300.0000130.34700.544119992.0rock, metal
4*NSYNCBye Bye Bye200560.00.02000.0NaN0.6140.9288.0-4806.00.00.05160.040800.0010400.08450.879172656.0pop
5SisqoThong Song253733.01.01999.069.00.7060.8882.0-6959.01.00.06540.119000.0000960.07000.714121549.0hip hop, pop, R&B
6EminemThe Real Slim Shady284200.01.02000.086.00.9490.6615.0-4244.00.00.05720.030200.0000000.04540.760104504.0hip hop
7Robbie WilliamsRock DJ258560.00.02000.068.00.7080.7727.0-4264.01.00.03220.026700.0000000.46700.861103035.0pop, rock
8Destiny's ChildSay My Name271333.00.01999.075.00.7130.6785.0-3525.00.00.10200.273000.0000000.14900.734138009.0pop, R&B
9ModjoLady - Hear Me Tonight307153.00.02001.077.00.7200.8086.0-5627.01.00.03790.007930.0293000.06340.869126041.0Dance/Electronic

Last rows

artistsongduration_msexplicityearpopularitydanceabilityenergykeyloudnessmodespeechinessacousticnessinstrumentalnesslivenessvalencetempogenre
1990Sam SmithHow Do You Sleep?202204.00.02019.073.00.4770.6821.0-4931.00.00.09250.15300.0000000.07630.345110567.0pop
1991NSGOptions240081.01.02020.057.00.8360.6211.0-4684.00.00.08940.38900.0000920.10400.762101993.0World/Traditional, hip hop
1992NormaniMotivation193837.00.02019.071.00.5990.8874.0-3967.01.00.09840.01920.0000010.30000.881170918.0pop, R&B
1993Joel CorrySorry188640.00.02019.063.00.7440.7908.0-4617.00.00.05620.05470.0008020.32000.847125002.0pop, Dance/Electronic
1994Post MaloneGoodbyes (Feat. Young Thug)174960.01.02019.01.00.5800.6535.0-3818.01.00.07450.44700.0000000.11100.175150231.0hip hop
1995Jonas BrothersSucker181026.00.02019.079.00.8420.7341.0-5065.00.00.05880.04270.0000000.10600.952137958.0pop
1996Taylor SwiftCruel Summer178426.00.02019.078.00.5520.7029.0-5707.01.00.15700.11700.0000210.10500.564169994.0pop
1997Blanco BrownThe Git Up200593.00.02019.069.00.8470.6789.0-8635.01.00.10900.06690.0000000.27400.811NaNhip hop, country
1998Sam SmithDancing With A Stranger (with Normani)171029.00.02019.075.00.7410.5208.0-7513.01.00.06560.45000.0000020.22200.347102998.0pop
1999Post MaloneCircles215280.00.02019.085.00.6950.7620.0-3497.01.00.03950.19200.0024400.08630.553120042.0hip hop

Duplicate rows

Most frequently occurring

duration_msexplicityearpopularitydanceabilityenergykeyloudnessmodespeechinessacousticnessinstrumentalnesslivenessvalencetempogenre# duplicates
0177184.01.02016.076.00.8860.4276.0-10028.01.00.14500.0312000.0009900.09060.230108034.0hip hop2
1177685.01.02013.056.00.7600.6526.0-7321.01.00.23200.0348000.0000000.30700.759100315.0hip hop, pop2
2184573.00.02010.066.00.7990.7831.0-3896.00.00.03220.0346000.0186000.07570.586127041.0pop2
3192190.00.02015.076.00.6880.7027.0-4792.00.00.04990.0215000.0000000.12800.74094006.0hip hop, pop2
4193653.00.02005.065.00.4690.95510.0-4253.01.00.04320.0003430.0000010.54800.462143853.0rock, pop2
5195200.00.02015.075.00.5260.8622.0-6003.01.00.09050.0144000.0597000.22900.52890052.0hip hop, rock, pop2
6198293.01.02015.078.00.7650.35611.0-5556.00.00.19500.2230000.0000000.09630.18996991.0hip hop, pop, R&B2
7199480.00.02011.067.00.7070.8617.0-4225.01.00.31600.1000000.0000000.19100.795130021.0hip hop, pop, Dance/Electronic2
8200185.00.02018.086.00.3510.2964.0-10109.00.00.03330.9340000.0000000.09500.120115284.0pop, Dance/Electronic2
9200786.00.02015.080.00.6540.7600.0-3669.00.00.04500.0797000.0000000.29900.41099945.0pop2